3 research outputs found

    Analyzing Storage System Workloads

    Get PDF
    Analysis of storage system workloads is important for a number of reasons. The analysis might be performed to understand the usage patterns of existing storage systems. It is very important for the architects to understand the usage patterns when designing and developing a new, or improving upon the existing design of a storage system. It is also important for a system administrator to understand the usage patterns when configuring and tuning a storage system. The analysis might also be performed to determine the relationship between any two given workloads. Before a decision is taken to pool storage resources to increase the throughput, there is need to establish whether the different workloads involved are correlated or not. Furthermore, the analysis of storage system workloads can be done to monitor the usage and to understand the storage requirements and behavior of system and application software. Another very important reason for analyzing storage system workloads, is the need to come up with correct workload models for storage system evaluation. For the evaluation, based on simulations or otherwise, to be reliable, one has to analyze, understand and correctly model the workloads. In our work we have developed a general tool, called ESSWA (Enterprize Storage System Workload Analyzer) for analyzing storage system workloads, which has a number of advantages over other storage system workload analyzers described in literature. Given a storage system workload in the form of an I/O trace file containing data for the workload parameters, ESSWA gives statistics of the data. From the statistics one can derive mathematical models in the form of probability distribution functions for the workload parameters. The statistics and mathematical models describe only the particular workload for which they are produced. This is because storage system workload characteristics are sensitive to the file system and buffer pool design and implementation, so that the results of any analysis are less broadly applicable. We experimented with ESSWA by analyzing storage system workloads represented by three sets of I/O traces at our disposal. Our results, among other things show that: I/O request sizes are influenced by the operating system in use; the start addresses of I/O requests are somewhat influenced by the application; and the exponential probability density function, which is often used in simulation of storage systems to generate inter-arrival times of I/O requests, is not the best model for that purpose in the workloads that we analyzed. We found the Weibull, lognormal and beta probability density functions to be better models

    A Storage System Workload Analyzer

    Get PDF
    The analysis of storage system workloads is important for a number of reasons. It is necessary to understand the usage patterns of secondary storage to enable architects to understand and build a new, or improve upon an existing storage system. It is also important for a storage administrator to understand the workload profile when configuring and tuning a system. Furthermore, the analysis of workloads is necessary to understand the storage requirements and behavior of application software. A different reason for workload analysis, is the need to come up with adequate models for performance evaluation. For the evaluation, based on simulation or otherwise, to be reliable one has to analyze, understand and adequately model the workloads. In this paper we describe a methodology to go about storage system workload analysis and illustrate it with a general tool we call the Enterprize Storage System Workload Analyzer or ESSWA. Although such analyzes are not new, we believe our proposals have a number of advantages over previous analyzers described in the literature. We experimented with ESSWA by analyzing workloads represented by three sets of publicly available workload traces

    Analyzing Storage System Workloads

    Get PDF
    Disk storage subsystems have not kept up the speed with processors. Processor performance has been increasing at a much higher rate than that of disk drives. Therefore, I/O subsystem has become a bottleneck in current computer systems. With this in mind the research community is looking into ways of improving the I/O subsystem. IBM and HP are among the organizations doing research and development of high performance storage systems also referred to as Enterprise Storage Systems (ESSs). Much of this effort goes into the evaluation of these systems for correctness and performance. For these evaluations, using simulations or otherwise, to be reliable, there is need to correctly understand and model the disk I/O workloads also known as I/O traffic or physical I/O workloads. Therefore in our work, we analyzed I/O workload traces to derive statistics which can be used as a guide in the (a) modelling of I/O workload and subsequent production of I/O workload representative of the actual I/O workload for evaluation and (b) optimization of the storage systems. Our results show that the distribution of inter-arrival times of I/O requests are heavy-tailed, and that the I/O request sizes are a function of the operating system
    corecore